Picture for Chengquan Zhang

Chengquan Zhang

ChartArena: Benchmarking Chart Parsing across Languages, Scenarios, and Formats

Add code
May 31, 2026
Viaarxiv icon

PhoneWorld: Scaling Phone-Use Agent Environments

Add code
May 28, 2026
Viaarxiv icon

Chronicles-OCR: A Cross-Temporal Perception Benchmark for the Evolutionary Trajectory of Chinese Characters

Add code
May 12, 2026
Viaarxiv icon

Uni-OPD: Unifying On-Policy Distillation with a Dual-Perspective Recipe

Add code
May 05, 2026
Viaarxiv icon

Do Phone-Use Agents Respect Your Privacy?

Add code
Apr 02, 2026
Viaarxiv icon

MMTIT-Bench: A Multilingual and Multi-Scenario Benchmark with Cognition-Perception-Reasoning Guided Text-Image Machine Translation

Add code
Mar 25, 2026
Viaarxiv icon

Towards Real-World Document Parsing via Realistic Scene Synthesis and Document-Aware Training

Add code
Mar 25, 2026
Viaarxiv icon

Prune Redundancy, Preserve Essence: Vision Token Compression in VLMs via Synergistic Importance-Diversity

Add code
Mar 11, 2026
Viaarxiv icon

Omni-DPO: A Dual-Perspective Paradigm for Dynamic Preference Learning of LLMs

Add code
Jun 11, 2025
Viaarxiv icon

Recognition-Synergistic Scene Text Editing

Add code
Mar 11, 2025
Figure 1 for Recognition-Synergistic Scene Text Editing
Figure 2 for Recognition-Synergistic Scene Text Editing
Figure 3 for Recognition-Synergistic Scene Text Editing
Figure 4 for Recognition-Synergistic Scene Text Editing
Viaarxiv icon